Drift Detection Method Using Distance Measures and Windowing Schemes for Sentiment Classification

نویسندگان

چکیده

Textual data streams have been extensively used in practical applications where consumers of online products expressed their views regarding products. Due to changes distribution, commonly referred as concept drift, mining this stream is a challenging problem for researchers. The majority the existing drift detection techniques are based on classification errors, which higher probabilities false-positive or missed detections. To improve accuracy, there need develop more intuitive that can identify great number drifts streams. This paper presents an adaptive unsupervised learning technique, ensemble classifier opinion and sentiment classification. performance, approach uses four different dissimilarity measures determine degree stream. Whenever detected, proposed method builds adds new ensemble. add classifier, total classifiers first checked if limit exceeded before with least weight removed from end, weighting mechanism calculate each decides contribution final results. Several experiments were conducted real-world datasets results evaluated false positive rate, miss accuracy measures. also compared state-of-the-art methods, include DDM, EDDM, PageHinkley support vector machine (SVM) Naïve Bayes frequently studies. In all cases, show efficiency our method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A revised Fuzzy - PROMETHEE method , using Fuzzy Distance and Similarity Measures

PROMETHEE refers to a collection of methods of ranking in the field of multi-criteria decision making. These methods are characterized by conceptual simplicity and practical applicability. However, the nature of phenomena involving decision-making in real world leads us to use fuzzy method of preference ranking. The most common criticism on mathematical ranking procedures is that they tend to d...

متن کامل

A Hierarchical Classification Method for Breast Tumor Detection

Introduction Breast cancer is the second cause of mortality among women. Early detection of it can enhance the chance of survival. Screening systems such as mammography cannot perfectly differentiate between patients and healthy individuals. Computer-aided diagnosis can help physicians make a more accurate diagnosis. Materials and Methods Regarding the importance of separating normal and abnorm...

متن کامل

New distance and similarity measures for hesitant fuzzy soft sets

The hesitant fuzzy soft set (HFSS), as a combination of hesitant fuzzy and soft sets, is regarded as a useful tool for dealing with the uncertainty and ambiguity of real-world problems. In HFSSs, each element is defined in terms of several parameters with arbitrary membership degrees. In addition, distance and similarity measures are considered as the important tools in different areas such as ...

متن کامل

Distance Measures for Prototype Based Classification

The basic concepts of distance based classification are introduced in terms of clear-cut example systems. The classical k-NearestNeigbhor (kNN) classifier serves as the starting point of the discussion. Learning Vector Quantization (LVQ) is introduced, which represents the reference data by a few prototypes. This requires a data driven training process; examples of heuristic and cost function b...

متن کامل

Correlation-Based Method for Sentiment Classification

The classic supervised classification algorithms are efficient, but time-consuming, complicated and not interpretable, which makes it difficult to analyze their results that limits the possibility to improve them based on real observations. In this paper, we propose a new and a simple classifier to predict a sentiment label of a short text. This model keeps the capacity of human interpret-abili...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computers, materials & continua

سال: 2023

ISSN: ['1546-2218', '1546-2226']

DOI: https://doi.org/10.32604/cmc.2023.035221